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Abstract 

Approximation algorithms for classical constraint satisfaction problems are one of the main re- 
search areas in theoretical computer science. Here we define a natural approximation version of 
the QMA-complete local Hamiltonian problem and initiate its study. We present two main results. 
The first shows that a non-trivial approximation ratio can be obtained in the class NP using product 
states. The second result (which builds on the first one), gives a polynomial time (classical) algo- 
rithm providing a similar approximation ratio for dense instances of the problem. The latter result is 
based on an adaptation of the "exhaustive sampling method" by Arora et al. [ AKK99 1 to the quantum 
setting, and might be of independent interest. 



1 Introduction and Results 

In the last few years, the quantum analog of the class NP, the class QMA MKSV02II . has been exten- 
sively studied, and several QMA-complete problems have been found BLiu061 IBra061 ILCV071 IBS071 
IRos091 UGLTOI ISV091 rWMNlOI . Arguably the most important QMA-complete problem is the fc-local 
Hamiltonian problem HKSV021 IKR03I IOTfJ8l IKKR06I IAGIK091 . Here, the input is a set of Hamilto- 
nians (Hermitian matrices), each acting on at most fc-qubits each. The task is to determine the largest 
eigenvalue of the sum of these Hamiltonians. This problem generalizes the central NP-hard problem 
MAX-A>CSP, where we are given a set of Boolean constraints on k variables each, with the goal to 
satisfy as many constraints as possible. The local Hamiltonian problem is of significant interest to com- 



plexity theorists and to physicists studying properties of physical systems alike (e.g. BBV05I lAvDK + 07 
IBDOT08I IAALV091 ICV091 ILLM+ 101 \SCW\ ). 

Moving to the classical scenario, the theory of NP-completeness is one of the great success stories 
of classical computational complexity MAB0911 . It was soon realized that many natural optimization 
problems are NP-hard, and are hence unlikely to have polynomial time algorithms. A natural question 
(both in theory and in practice) is to look for polynomial time algorithms that produce solutions that 
are close to optimum. More precisely, one says that an algorithm achieves an approximation ratio 
of c G [0, 1] for a certain maximization problem if on all inputs, the value of the algorithm's output 
is at least c times that of the optimum solution. The closer c is to 1, the better the approximation. 
The investigation of approximation algorithms is, after decades of heavy research, still a very active 
area (e.g., BHoc971IVaz01l ). For many central NP-hard problems, tight polynomial time approximation 
algorithms are known. 

In the context of QMA-complete problems, it is thus natural to search for approximation algorithms 
for these problems, and in particular for the local Hamiltonian problem. The question we address here 
is: How well can one efficiently approximate the k-local Hamiltonian problem? 



*David R. Cheriton School of Computer Science and Institute for Quantum Computing, University of Waterloo, Waterloo 
N2L 3G1, Canada. Supported by the Natural Sciences and Engineering Research Council of Canada (NSERC), NSERC 
Michael Smith Foreign Study Supplement, and by EU-Canada Transatlantic Exchange Partnership programme. 

+ Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel and CNRS & LRI, University of 
Paris-Sud, Orsay, France. Supported by an Individual Research Grant of the Israeli Science Foundation, by European Research 
Council (ERC) Starting Grant QUCO and by the Wolfson Family Charitable Trust. 



1 



It should be noted that a large host of heuristics has been developed in the physics community to 
approximate properties of local Hamiltonian systems (see, e.g., BCV091 for a survey) and this area is ex- 
tremely important in the study of physical systems. However, the systematic complexity theoretic study 
of approximation algorithms for QMA-complete problems is still very much in its infancy, and our work 
is one of the first steps in this research direction. We note that there has been a lot of interest in re- 
cent years HAALV09irAar061 in establishing a so-called quantum PCP theorem (e.g. HAS98llALM + 98t ). 
which amounts to showing that for some constant c < 1 close enough to 1, approximating the fc-local 
Hamiltonian (or related problems) to within c is QMA-hard. Our results can also be seen as a natural 
continuation of that investigation. 



Our results: Let us start by precisely defining the optimization version of the local Hamiltonian prob- 
lem, which is parameterized by two integers k and d, which we always think of as constants. 

Definition 1 (MAX-fc-local Hamiltonian problem on d-level systems (qudits)). An instance of the prob- 
lem consists of a collection of (> ) Hennitian matrices, one for each subset of k qudits. The matrix 
Hii,...,i k corresponding to some 1 < i\ < • • • < \ < n is assumed to act on those quditsrj to be positive 
semidefinite, and to have operator norm at most 1. We call any pure or mixed state p on n qudits an 
assignment and define its value to be TrHp where H = X^- j. Hi lr j k . The goal is to find the largest 
eigenvalue of H (denoted OPT), or equivalently, the maximum value obtained by an assignment. We 
say that an algorithm provides an approximation ratio ofcE [0, 1] if for all instances, it outputs a value 
that is between c ■ OPT and OPT. 

This definition, we believe, is the natural quantum analog of the MAX-fc-CSP problem. We note that 
it differs slightly from the usual definition of the fc-local Hamiltonian problem. Namely, we consider 
maximization (as opposed to minimization), and also restrict the terms of H to be positive semidefinite, 
and have norm at most 1. As long as one considers the exact problem, these assumptions are without 
loss of generality, and do not affect the definition, as seen by simply scaling the Hamiltonians and 
adding multiples of identity as necessary. However, when dealing with the approximation version, these 
assumptions are important for the problem to make sense; for instance, one cannot meaningfully talk 
about approximation ratios if the optimum can take both negative and positive values. That is why we 
require the terms to be positive semidefinite. The requirement that the terms have operator norm at 
most 1 does not affect the problem and later allows us to conveniently define dense instances. Finally, 
changing the maximization to a minimization would lead to an entirely different approximation problem: 
the quantum analogue of MIN-CSP (e.g. BKSTWOlll ). Minimization problems are, generally speaking, 
harder than maximization problems, and we leave this research direction for future work. 

Before stating our results, we remark that there is a trivial way to get a d^-approximation for MAX- 
fc-local Hamiltonian. Observe that the maximally mixed state has at least dr k overlap with the reduced 
density matrix of the optimal assignment on any k particles. A similar thing holds classically, where a 
random assignment gives (in expectation) a d approximation of MAX-fc-CSP. We now describe our 
two main results. 



Approximation by product states. One inherently quantum property of the local Hamiltonian prob- 
lem is the fact that the optimal state might in general be highly entangled (and hence not efficiently 
describable in polynomial time). This is why we do not require outputting the assignment itself in the 
above definition. If, however, the optimal assignment (or some other good assignment) was guaranteed 
to be a tensor product state, then we could describe it efficiently. The following theorem shows just that. 

Theorem 2. For an instance of MAX-k-local Hamiltonian with optimal value OPT, there is a (pure) 
product state assignment that has value at least OPT/ d ^. 

'Terms acting on less than k qudits can be incorporated by tensoring them with the identity. 
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This result is tight for product states in the case of 2-local Hamiltonians (we remark that 2-local 
Hamiltonians are often the most relevant case from a physics perspective). For example, consider the 
Hamiltonian on 2-qubits that projects onto the EPR state ^(|00) + |H)). It is easy to see that no 
product state achieves value more than 1/2. For general k, we can only show that product states cannot 
achieve an approximation ratio greater than l/rf^/ 2 J (see Sec. [3]). 

If we could efficiently find the best product state assignment, we would obtain an algorithm achiev- 
ing a non-trivial d~ k+1 approximation ratio. Unfortunately, this problem is NP-complete, since it would 
allow one to solve (e.g.) the special case of MAX-fc-SAT, and hence we do not have such an algorithm. 
Still, the theorem has the following interesting implication: It shows that unless NP = QMA, approxi- 
mating the local Hamiltonian problem to within a factor less than d~ k+1 is not QMA-hard. This follows 
simply because product states have polynomial size classical descriptions. 

A polynomial time approximation algorithm for dense instances. Our second result gives a clas- 
sical polynomial time approximation algorithm for dense instances of the local Hamiltonian problem. 
This result is perhaps our technically most challenging one, and we hope the techniques we develop 
might turn out useful elsewhere. 

Dense instances of classical constraint satisfaction problems have been studied in depth (see e.g. HdlV961 
IFK961 IGGR981 IAKK99I IdlVKUOl IAdlVKKU21 lBdTVK03l IdlVKKVOSTn . Our result is inspired by work 
of Arora et al. MAKK991 who provide a polynomial time approximation scheme, or PTAS (i.e., an ef- 
ficient 1 — £ approximation algorithm for any fixed e > 0), for several types of dense constraint sat- 
isfaction problems. In the classical case, dense (for 2-local constraints) simply means that the aver- 
age degree in the constraint graph is O(n), or equivalently, that the optimum is Q(n 2 ). In analogy, 
we define an instance of MAX-A>local Hamiltonian to be dense if OPT = Cl(n k ), or equivalently, if 
Tr(H£)=Cl(n k )E 

It is not hard to see that the (exact) dense local Hamiltonian problem remains QMA-hard (see 
Sec. 13 -3b - The dense case might be of practical interest to physicists who study systems of particles 
by incorporating all possible interactions between them. Note that such instances are dense even if the 
interactions between particles are weak, so long as the interaction strengths are constants independent 
of n. Our second main result is the following: 

Theorem 3. For all £ > there is a polynomial time (l/d^ 1 — e) approximation algorithm for the 
dense MAX-k-local Hamiltonian problem over audits. 

Thm. [3] follows immediately by combining Thm. |2] with the following theorem, which gives an 
approximation scheme for the problem of optimizing over the set of product states. 

Theorem 4. For all £ > there is a polynomial time algorithm for dense MAX-k-local Hamiltonian 
that outputs a product state assignment with value within 1 — £ of the value of the best product state 
assignment. 

Proof ideas and new tools: The proofs of Thm. [2] and Thm. |4] are independent and employ different 
techniques. To show the product state approximation guarantee, we show a slightly stronger statement: 
For any assignment | Y) , there is a way to construct a product assignment of at least dr k+1 its value. The 
proof is constructive (given |Y)): we use a type of recursive Schmidt decomposition of |Y) to obtain a 
mixture of product states whose value is off by at most the desired approximation factor (see Sec. [2]). 

Our second result is technically more challenging and introduces a few new ideas to this problem, 
inspired by work of Arora et al. MAKK991 in the classical setting. We illustrate the main ideas for MAX- 
2-local Hamiltonian on n qubits. Recall that our goal is to find a PTAS for the local Hamiltonian problem 

2 The equivalence follows from the fact that the mixed state assignment Id/d n has value between OPT and OPT/ d . 
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over product states. The value of the optimal product state assignment, OPTp, can be written 

n 

OPTp = max £ Tr(H irj (pj <g> pj)) s.t. p, ^ and Tr(p ( ) = 1 for 1 < i < n, (1) 

i=l j<EN(i) 

where N(i) is the set of indices j for which a local Hamiltonian term Hn is present. We might call this 
a quadratic semidefinite program, as the maximization is quadratic in the p, (and as such not efficiently 
solvable). Note, however, that if the terms in the maximization were linear, then we would obtain a 
semidefinite program (SDP), which is efficiently solvable BGLS93L To "linearize" our optimization, we 
use the "exhaustive sampling method" developed by Arora et al. MAKK991 (a method which was later 
key in many developments in property testing, e.g. MGGR98I1 ). We write each Hamiltonian term in a 
basis that separates its two qubits, for instance the Pauli basis {<Tq, <J\, C3}, H; ; = Y% i=o K kWk ® 
For i = 1, . . . , n and k = 0, 1, 2, 3, define 

4 : = E E4 r K^;)- 

jGN(i) I 

If we knew the values of ci for the optimal pi, then solving the SDP below would yield the optimal pf. 

n 3 

max E c k^ r (°kPi) s -t- Pi ^ an d Tr(p,-) = 1 for 1 < i < n, (2) 

!=1 k=0 

E E4 ; ; Tr (^i°/) =4 for 1 < z < n and < /c < 3. 

jeN(i) l 

Of course, this reasoning is circular, as in order to obtain the ci we need the optimal pi. The crucial idea 
is now to use sampling to estimate the ci. More precisely, assume for a second that we could sample 
O(logn) of the p, randomly from the optimal assignment. Then, by standard sampling bounds, with 
high probability over the choice of the sampled qubits we can estimate all the ci to within an additive 
error ±en for some e. If we had these estimates ai for the ci, we could solve the SDP above with the 

slight modification that the last constraint should be a' k — en < £]y e wi) YLl °^i^ r {piPj) — a \ + m - With 
high probability over the sampled qubits, this SDP will give a solution that is within an additive en 2 of 
the optimal one (more subtle technicalities and all calculations can be found in Sec. [3]). Moreover, it is 
possible to derandomize the sampling procedure to obtain a deterministic algorithm (Sec. l3.3t i. 

Of course, we are still in the realm of wishful thinking, because in order to sample from the optimal 
solution, we would need to know it, which is precisely what we set out to do. However, the number of 
qubits we wish to sample is only logarithmic in the input size. Thus, to simulate the sampling procedure, 
we can pick a random subset of O(logn) qubits, and simply iterate through all possible assignments 
on them (with an appropriate <5-net over the density matrices, which incurs a small additional error) in 
polynomial time! Our algorithm then runs the SDP for each iteration, and we are guaranteed that at least 
one iteration will return a solution within en 2 of the optimal one. Because the denseness assumption 
guarantees that OPTp is Q(n 2 ), our additive approximation turns into a factor (1 — e) -approximation, 
as desired. All details, the runtime of the algorithm and error bounds for the general /c-local case on 
qudits are given in Sec. [3] 



Previous and related work: We note that many heuristics have been developed in the physics com- 
munity to approximate properties of local Hamiltonian systems and this area is extremely important 
in the study of physical systems (e.g. HWhi921 IWhi931 IOR951 IRQ971 ISch05l IPWKE981 ICV09II ). Our 
focus here is, however, on rigorous bounds on the approximation guarantee of algorithms for the gen- 
eral problem. In this area, to our knowledge, few results are known. In a first result on polynomial 
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time approximation algorithms, Bansal, Bravyi and Terhal [ BBT09I give a PTAS for a special class of 
the local Hamiltonian problem, so called quantum Ising spin glasses, for the case where the instance is 
on a planar graph (and in particular of bounded degree). This PTAS is obtained by dividing the graph 
into constant size chunks, which can be solved directly, and ignoring the constraints between chunks 
(this incurs an error proportional to the number of such constraints, which is small because the graph is 
planar). More recently, there has been work proving rigorous approximations to ground states of one- 
dimensional quantum systems under well-defined conditions using techniques such as density matrix 
renormalization group MAAHOirSClOi To our knowledge, we are the first to establish a bound on the 
approximation factor by optimizing over the set of product states. 

Discussion and open questions: Our two results give approximations to the local Hamiltonian prob- 
lem. Although at first glance, our approximation ratio of l/d^ 1 may appear an incremental improve- 
ment over the trivial random assignment strategy, there are two important notes that should be kept 
in mind: The first is that many classical NP-hard problems, such as MAX-3-SAT (a special case of 
MAX-fc-CSP where each constraint is the disjunction ("OR") of k variables or their negation), are ap- 
proximation resistant (e.g. BHas07l IAM08I1 ). meaning that unless P=NP, there do not even exist non- 
trivial approximation ratios beyond the random assignment strategy. For example, for MAX-3-SAT it 
is NP-hard to do better than the approximation ratio of 7/8 achieved by random assignment ||Has97l . 
Thus, showing the existence of a non-trivial approximation ratio is typically a big step in the classical 
setting. Moreover, it could have been conceivable that for MAX-fc-local Hamiltonian, analogously to 
MAX-3-SAT, outperforming the random assignment strategy would have been QMA-hard. Yet our re- 
sults show that unless NP=QMA, this is not the case. The second important note that should be kept in 
mind is that the currently best approximation algorithm for MAX-fc-CSP gives an approximation ratio 
of only about 0Mk/2 k for k > 2 HCMM071 (for k = 2, one can achieve 0.874 HLLZ02II ) and this 
is, moreover, essentially the best possible under a plausible complexity theoretic conjecture (namely, 
the unique games conjecture BKho02ll ) MTre981 lHas05l IST061 IAM081 This is to be contrasted with our 
2/2 ,: -approximation ratio for the case of d = 2 (i.e. qubit systems), which we show can be achieved by 
product state assignments for arbitrary (i.e. even non-dense) MAX-/c-local Hamiltonian instances. This 
raises the important open question: is our approximation ratio tight? 

Our product state approximation shows that approximating the local Hamiltonian problem to within 
rf-k+i j s jjj jsjp j t wou id be interesting to know if this approximation ratio could also be achieved in 
polynomial time. If not, it might lead to an intriguing state of affairs where for low approximation ratios 
the problem is efficiently solvable, for medium ratios it is in NP but not efficiently solvable, and for high 
ratios it is QMA-hard (assuming a quantum PCP theorem exists). 

To obtain our results for the case of dense local Hamiltonians, we have introduced the exhaustive 
sampling technique of Arora et al. flAKK99l to the setting of low-degree semidefinite programs. We 
linearize such programs using exhaustive sampling in combination with a careful analysis of the error 
coming from working with <5-nets on density matrices. We remark that it seems we cannot simply 
apply the results of MAKK991 for smooth Polynomial Integer Programs as a black-box to our setting. 
This is due to our aforementioned need for a ^-net, as well as the requirement that our assignment be 
a positive semidefinite operator. We address the latter issue by extending the techniques of HAKK991 
to the realm of positive semidefinite programs by introducing the notion of "degree-fc inner products" 
over Hermitian operators to generalize the concept of degree-A: polynomials over real numbers, and 
performing the more complex analysis that ensues. We hope that this technique will be of much wider 
applicability, particularly considering the growing use of semidefinite programs in numerous areas of 
quantum computing and information (e.g. IIDPS041lJJUWT0llLMRST0l ). 

Another open question is whether similar ideas can be used to approximate other QMA-complete 
problems, such as the consistency problem HLiu06i Moreover, can we obtain polynomial time algo- 
rithms without the denseness assumption? And are there special cases of the local Hamiltonian problem 
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for which there is a PTAS (other than for planar Ising spin glasses MBBT09I )? Of course, we do not 
expect a PTAS for all instances of the local Hamiltonian problem, as this would contradict known hard- 
ness results for special classical cases of the problem. However, perhaps there exist other classes of 
physically relevant instances of the problem for which a PTAS does exist. 

Structure of this paper: In Sec. 12 we prove our result on product state approximations (Thm. [9] 
and the ensuing proof of Thm. [2]), show its tightness in the 2-local case and provide the upper bound 
of dr for the best possible approximation by product states. Sec. [3] gives our polynomial time 
approximation algorithm and develops the general sampling and SDP-based technique we use. It also 
shows that the dense local Hamiltonian problem remains QMA-complete. 

Notation: We use A >z to say operator A is positive semidefinite, and denote by L(X), H{X), 
and D(X) the sets of linear, Hermitian, and density operators acting on complex Euclidean space X, 
respectively. We denote the Frobenius and operator norms of A £ L(X) as || A || F = y / Tr(A + A) and 
II A L =maX| x)e ^ s . t || z || 2=1 || A\x) || 2 , respectively. 

2 Product states yield a 1 / d k ~ l -approximation for qudits 

We now show that product state assignments achieve a non-trivial approximation ratio for MAX-fc-local 
Hamiltonian , i.e. Thm|2] The heart of our approach is what we call the Mixing Lemma (Lem.[7]), which 
we use to prove Thm. [9] Thm. [2] will then easily follow. At the end of the section, we discuss the 
tightness of the approximation guarantee given by Thm. |2] We begin with two definitions. 

Definition 5 (Recursive Schmidt Decomposition (RSD)). We define the recursive Schmidt decomposi- 
tion of a state £ [<C d )® n as the expression obtained by recursively applying the Schmidt decompo- 
sition on each qudit from 1 to n — 1 inclusivd^. For example, the RSD for 3-qubit \ip) is 

\tp) = «i|ai) ® (fr|&i)|ci) +p 2 \h)\c 2 )) +cc 2 \a 2 ) Wi)|c'i) + p' 2 \b' 2 )\c' 2 )), 

for a \ + a| = fi\ + jS| = fi'i + ft' 2 = 1, {| fl /)},- an orthonormal basis for qubit 1, {!&;)},■ and {\b'i)}i 
orthonormal bases for qubit 2, and { |c,) } ( and { |c' ; ) } ; - orthonormal bases for qubit 3. 

Definition 6 (Schmidt cut). Forany|i/>) G (C d )®" with Schmidt decomposition \xp) =YLi=i 0L i\ w i)\ v i)' 
where \iVj) G C d and \v{) G (C rf ) 0f!_1 , and for any \(p) G (C rf ) 0m , we refer to the expansion \(p) ® 

(j2i=i a i\ w i) \ v i)^j as tne Schmidt cut at qudit m + 1. We say that a projector TI crosses this Schmidt 
cut if FI acts on qudit m + 1 and at least one qudit i G {m + 2, . . . ,m + n}. 

The heart of our approach is the following Mixing Lemma, which provides, for any assignment 
\tp) G (C d ) m , an explicit construction through which the entanglement across the first Schmidt cut of 
\ip) can be eliminated, while maintaining at least a (1 /^-approximation ratio relative to the value \tp) 
achieves against any local Hamiltonian H G H((C d )®"). 

Lemma 7 (Mixing Lemma). Given state \xp) on n qudits with Schmidt cut on qudit 1 given by = 
Y4=\0Ci\wi)\vi), where \wi) G C d and \vi) G (C rf )®" _1 , define p := ^ =1 txf\Wi)(iVi\ (g> \vi)(Vi\. 
Then, given projector IT acting on some subset S of the qudits, if IT crosses the Schmidt cut, then 
Tr(np) > ±Tr(IT | !/>)()/> |). Otherwise, Tr(Eip) = Tr(n|^)(^|). 

3 This definition is relative to some fixed ordering of the qudits. The specific choice of ordering is unimportant in our 
scenario, as any decomposition output by such a process suffices to prove Thm. [2] 
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Proof. Case 2 follows easily by noting that the given Schmidt decomposition of \tp) implies Tri(p) = 
Tri(|i/?) (xp\) and Tr2,..., n (p) = T^2,...,n(\ip) (Vl)- To prove case 1, we observe by straightforward expan- 
sion that 

7r(Tl\ip)(xp\) = Tr(TIp) +E«i*;( w <K t '»l n l« 7 />l 1 ';) +a,-ay(wy|(o / |n|a; I -)|o i ). (3) 
Then, by defining for each i vector |a,) := OiiTI\wj) \vi), we have that 

Y^CtiUj{Wi\{Vi\Il\Wj)\Vj) +KiCtj{lVj\{Vj\II\Wi)\Vi) = Y.^Mj) + ( a j\ a i)- 

i<j i<j 
Applying the facts that IT 2 = n and (a\b) + (b\a) < \\ \a) \\\ + || \b) \\\ for \a), \b) G {C d ) m implies 
J^<«i|ay) + (fl ; -|flj) < J] || |flf) H2+ I) |fl;) H2 = {d-\)Y d x 2 i {wi\{vi\U\wi)\v i ) = (d-l)Tr(np), 

i<j i<j i 

from which the claim follows. □ 

The following simple extension of Lem. simplifies our proof of Thm.[9] 

Corollary 8. Define \ip') := \<p) <g> \ip), where \(p) G (C d ) 0m for m > and \xp) is defined as in 
Lem. [7] and let p G D(C d )® n be obtained from \ip) as in Lent. [7J Then, for any projector IT acting on a 
subset S of the audits, ifW crosses the Schmidt cut of\ip') at audit m + 1, we have Tr(IT|<^) (<p\ <g> p) > 
iTr(n|i/>')(i//|). Otherwise, 1r(Jl\(p) {<p\ ® p) = Tr(IT|t//) (i//|). 

Proof. Immediate by applying the proof of Lem.|7]with the following modifications: (1) Define |o,-) := 
a,-n|0) \wj) \vj), and (2) if S C {1, . . . ,m} U {m + 2, . . . ,m + n} (i.e. this is one of two ways for IT 
not to cross the cut — the other way is for S C {1, . . . , m + 1}), observe that by the same arguments as 
in Lem.[7]for case 2 and the product structure between \<p) and \xp) in \xp') that Tr m+ i(\(p) (<p\ ® p) = 
Tr w+1 (|t//)(t//|). □ 

Lemma |7] shows that the state p obtained by mixing the d Schmidt vectors of \ip), as opposed to 
taking their superposition, suffices to achieve a (1/d) -approximation across the first Schmidt cut. By 
iterating this argument over all n — 1 Schmidt cuts, we now prove that a mixture of all (product) states 
appearing in the RSD of \tp) achieves an approximation ratio of l/d fc_1 . 

Theorem 9. For any n-qudit assignment \ip) with RSD \xp) = Y^=\ s/Vi\ < Pi)> where Y^iPi = 1 an d 

{|</>i)};=i is a set of orthonormal product vectors in (C rf )®", define p := Y?i=i Vi\ ( Pi){ ( Pi\- Then, for 
any projector IT acting on some subset S C {1, . . . ,n} of audits with \S\ = k, we have Tr(ITp) > 

^TrWMI). 

Proof. Let IT be a projector with |<S| = k, and define c G {0, 1}" 1 such that c(j) = 1 iff IT crosses 
the Schmidt cut at qudit j. For example, if IT acts on qudits {1, 2}, then c = (1, 0, . . . , 0). Note that in 
general || c Hj = k — 1. We proceed by iteratively stepping through each Schmidt cut in the RSD of \xp). 
Let p(°) := \ip) {xp\, and consider first the cut at qudit 1, i.e. \tp) = Yd=\ a i\ w i) f° r \ w i) S C d and 
\Vi) G (C d ) m - 1 . Defining pM := £f =1 a?K)(Wi| <g> |p;)(p,-|, we have by Lem. [7] that 

Tr(n|v><vl) < <fWTr(npW), (4) 

i.e. we lose a factor of 1 iff IT crosses the first cut. 

Moving on to the second Schmidt cut, consider the state |h?i)|z?i) G C d (£> (C^) " -1 appearing 
in the expression for pW. Observe that it satisfies the preconditions for Cor. [8] with m = 1. Hence, 
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via Cor. [8] there exists a state d\ acting on qudits {2, . . . ,n} such that Tr(n|z0i)(u>i| <8> |z>i)(z>i|) < 
d c ( 2 'Tr(n|wi)(wi| (g) Pi). We can analogously find states a, corresponding to \wi)\v{) for all 1 < i < 
d. Thus, 



Tr(ripW) 



Y^ot?Ti(n.\wi)(Wi 

i=\ 



\Vi){Vi\)<d<V 



Y^OifTr (Yl\wi){wi\ ® (7-j) 
i=i 



(5) 



Hence, by defining p^> := Yd=\ \ w i) { w i\ ® &u we have y i a Eqns. © and ([5]) that 

Tr(n|V>(^|) < d c(1)+c(2) Tr(lV 2 )). 

Since by Cor. [8j the cj, are mixtures of Schmidt vectors from the second Schmidt cut, we can now 
iteratively apply the same procedure to the (at most d 1 ) pure states appearing in the expression for p^ 
when considering the third Schmidt cut. Note in particular that each of these terms will have a product 
structure between qudits {1, 2} and {3, . . . , n}, as required by Cor. [8]for the next iteration. 

More generally, when considering the pth Schmidt cut, we apply Cor. [8] with m = p — 1 to each of 
the at most terms appearing in the expansion of p(P~ l \ We continue iterating in this fashion until 
we have exhausted all n — 1 Schmidt cuts, at which point the resulting mixture p( n ~*^ we are left with 
is in fact the p from the statement of the claim (seen by noting that our procedure essentially iteratively 
computes the RSD of \xp), mixing the Schmidt vectors it computes at each step). Moreover, due to the 
repeated application of Cor. [8j we have 



7r(Tl\ip)(xp\) < rfH c lliTr(np^-^). 



(6) 



Recalling that || c |L = k — 1 completes the proof. 



□ 



Proof of Thm. |2j Simply apply Thm. [9] to each projector in the spectral decompositions of each 
(positive semidefinite) H, in our MAX-fc-local Hamiltonian instance H = £;H;, and let \tp) denote 
the optimal assignment for H. It is important to note that we can exploit Thm. [9] in this fashion due to 
the fact that the p constructed by Thm.|9]is independent of the projector n — i.e. for any fixed \xp) and 
k, the state p provides the same approximation ratio against any fc-local projector Yl encountered in the 
spectral decompositions of the H, . Finally, note that one can find a pure product state achieving this 
approximation guarantee since p is a convex mixture of pure product states. 

Upper bound of d~~ W for product state approximations. Is the result of Thm. [2]tight? In the case 
of MAX-2-local Hamiltonian on qudits, yes — consider a single clause projecting onto the maximally 
entangled state YLi |m)> f° r which a product state achieves value at most 1/d. On the other hand, for 
MAX-3-local Hamiltonian on qubits, the worst case clause for a 3-qubit product state assignment is the 
projector onto the state \W) = -75 (| 001) + |010) + 1 100) ) HTWP091 But here product states achieve 
value 4/9 HWG03II . implying the bound of 1/4 from Thm.|2]is not tight. 

A simple construction shows that the true optimal ratio is upper bounded by dr L2 J . To see this, con- 
sider a single clause which is the tensor product of maximally entangled bipartite stated). For example, 
for n = 4, consider the clause |</> + )(</> + | ® |</> + ) (</> + 1, where |^ + ) = 4g(|00) + |11)). The maximum 
value a product state can attain is 1/4, as claimed. In the qubit setting (d = 2), one can further improve 
this construction for odd k by replacing the term \(p + ) ((p + \ ® I on the last three qubits with |W) (W\. 
For example, for k = 5, setting our instance to be the clause \(p + ) ((p + 1 <8> | W) (W| yields an upper bound 
of (1/2) (4/9) = 2/9 < 1/4 = cHzJ (where we again use the value 4/9 for \ W) from the previous 

—k+7 

paragraph). For general odd k > 1, this improved bound generalizes to 2~ t~ /9. 
4 For odd k, we assume the odd qudit out projects onto the identity. 
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3 Optimizing over the set of separable quantum states 



Section [2] showed that there always exists a product state assignment achieving a certain non-trivial 
approximation ratio. In this section, we show how to efficiently find such a product state. Our main 
theorem of this section is the following (Thm.fTOl). from which Thm. [4] follows easily (see discussion at 
end of Sec. [33). 

Theorem 10. Let H be an instance of MAX-k-local Hamiltonian acting on n qudits, and let OPTp 
denote the optimum value ofTr(Hp) over all product states p G D((C d )® n ). Then, for any fixed e > 0, 
there exists a polynomial time (deterministic) algorithm which outputs p\ ® • • • ® pn G D((C d )® n ) 



such that Tr(Hpi 



>p n ) > OPTp -en* 



We first outline our approach by generalizing the discussion in Sec.Q] introducing tools and notation 
we will require along the way. The optimal value OPTp over product state assignments for any MAX- 
A:-local Hamiltonian instance can be expressed as the following program, denoted Pi : 



OPTp = max £ Tr(H ;i ik p h 



<pi k ) s.t. pi h and Tr(p f ) = 1 for 1 < / < n. (7) 



As done in Eqn. ©, we now recursively decompose our objective function as a sequence of nested sums. 
Let {(7/}f =1 be a traceless, Hermitian orthogonal basis for the set of Hermitian operators acting on C d , 
such that Tr (c^-cy) = 25 ij (for 5{j the Kroenecker delta) (see, e.g. BKim0310 . Then, by rewriting each 

Hii,...,i k in terms of {<7j} d =1 , our objective function becomes 



E Tr 



i k ,...,H 




r h jklk 



E Tr Ka) 

k-ik 



, E T < a h-ipik-i) 




, (8) 



where each r h '-' tk G ]R rf2 . We henceforth think of the objective function above as a "degree-A: inner 
product", i.e. as a sequence of k nested sums involving inner products, in analogy to the degree-k 
polynomials of Ref. MAKK99L In this sense, a degree- 1 inner product would refer to only the innermost 
sums over i\ and j\, and a degree-A; inner product would denote the entire expression in Eqn. ([8]). More 
formally, we denotd^la degree-b inner product for 1 < b < k using map fj, : H(C d ) xn h-> IR, defined 



suchthaE*&(pi,.../Pn) := Li b ,j b Ma jb p lb ) ■ ■ ^ lr {[Lj^Zffi) (H^ 

Our approach is to "linearize" the objective function of Pi using exhaustive sampling and recursion 
to estimate its degree- (k — 1) inner products. To do so, we will require the Sampling Lemma. 



Lemma 11 (Sampling Lemma MAKK 991). Let (fl ; ) be a sequence ofn real numbers with |a, | < Mfor 
all i, and let f>0.Ifwe choose a multiset ofs = gTogn of the a, at random (with replacement), then 

i-f. 



their sum q satisfies J^j a, — nhA\ ^ < < Yd a i + nM\j ^ with probability at least 1 



The proof of Lemma [TT1 follows from a simple application of the Hoffding bound [Hof64]. To use 
the Sampling Lemma in conjunction with exhaustive sampling, we will discretize the space of 1-qudit 
density operators using a ^-net G C H(C d ), such that for all p G D(C rf ), there exists a G G such that 
II P — cr IL < 5. We now show how to construct G. 



5 See the beginning of App.fA]for more elaborate notation used in the proofs of the claims of Sec. [3] 
6 Note that f;, implicitly depends on parameters ■ ■ ■ Ak an d 764 



hi' 



,1k- 
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To obtain G, we instead construct a <5-net for a subset of H(C d ) which contains D(C d ), namely the 
se|l|.A(C d ) := {A G H(C d ) | max^ \ A(i,j) \ < l}. Creating a S-net over A(C d ) is simple: we cast a 
(S/d) -net over the unit disk for each of the complex d(d — 1) /2 matrix entries above the diagonal, and 
likewise over [—1, 1] for the entries on the diagonal. Letting m and n denote the minimum number of 
points required to create such (<S/d)-nets for each of the diagonal and off-diagonal entries, respectively, 

we have that |G| = m z n d . For example, simple nets of size m d/S and n d 2 /S 2 can be 
obtained by placing a ID and 2D grid over [—1, 1] and the length 2 square in the complex plane centered 
at (0,0), respectively, implying |G| G O(l) when d G 0(1). To show that G is indeed a ^-net, we now 
bound the FrobeniuCl distance between arbitrary p G D(C rf ) and the closest p G G. Specifically, let 
A := p - p. Then: 

|| A || F = \Jlr(A f A) = ^£ |A( W ')| 2 < ^_>/^) 2 = £(<0 = *■ 

Finally, we remark that our dense assumption on MAX-A:-local Hamiltonian instances is only nec- 
essary to convert the absolute error of Thm. [10] to a relative one BGK1II (this conversion is detailed in 
Sec. 13- 3b - The remaining sections are organized as follows: In Sec. 13- 1 L we show how to recursively 
estimate degree-^ inner products using the Sampling Lemma. We then use this estimation technique in 
Sec. l3.2l to linearize our optimization problem Pi. Sec. l3.3l brings everything together by presenting and 
analyzing the complete approximation algorithm. To ease reading of the remaining sections, all techni- 
cal proofs are found in App.lAl Please see the beginning of App.lAlfor definitions of the more elaborate 
notation used in these proofs. 



3.1 Estimating degree-fc inner products using the Sampling Lemma 

Our recursive procedure, EVAL, for estimating a degree-/: inner product using the Sampling Lemma is 
stated as Alg. [T2j There are two sources of error we must analyze: the Sampling Lemma, and our S-net 
over C d . We claim that EVAL estimates the degree-fo inner product t\,(p\, . . . ,p n ) to within additive 
error ±e\,n° , where is defined as follows. Set A := \/2d(l + <5), for 5 from our 5-net. Then, 




The following lemma formalizes this claim. We adopt the convention of MAKK991 and let x G y ± z 
denote x G [y,z]. Alg.[l2]is our operator analogue of the algorithm Eval in Section 3.3 of MAKK991 . 

Lemma 13. Let t k : H(C ,C ) X " i-> K be defined using set {H k/ „ Jk } C H((C d ) m ) (as in Eqn. ©). Let 
S C { 1, . . . , n} such that \S\ = g log n have its elements chosen uniformly at random with replacement. 
Let p\, . . . ,p n G D(C d ) be some assignment on all n audits, and {pi : i G S} a set of elements in our 5- 
net such that || p,- — pi |L < 5 for all i G S. Then, for 1 < b < k, with probability at least 1 — d 2b n b ~f, 
we have EVAL(^, S, {pj : i G S}) G f;,(pi, . . . ,p n ) ± £b nh , where is defined as in Eqn. ([9]). 



3.2 Linearizing our optimization problem 

Our procedure, LINEARIZE, for "linearizing" the objective function of Pi using EVAL from Sec. 13. 1 l is 
stated as Alg. [141 Alg. [141 takes as input Pi and a set of sample points {pi}, and outputs a semidefmite 
program (SDP) which we shall henceforth refer to as P2. We remark that LINEARIZE is our version of 

7 Note: A net over A(C^) may allow non-positive assignments for a qudit. See Sec. l3.3l for why this is of no consequence. 
8 We use the Frobenius norm as it allows a simple analysis. It is straightforward, however, to switch to say the trace norm 
using the fact that || X || F < \fd \\ X || tr for all X 6 C rf . 
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Algorithm 12. EVAL( t h , S , {pi : i E S} ). 

• Input: (1) A degree-b inner product t b : H(C d ) y " R for 1 < b < k 

(2) A subset S C {1, . . .,n} of size \S\ = O(logn) 

(3) Sample points {pj : i G S} such that || — || F < 5 for all j G S 

• Output: x G R such that x G f&(/0i, . . . ,p n ) ± €b nh (f° r £fa defined in Eqn. ©). 

1. For all i G S and ; = 1 . . . d 2 : 

(a) (Base Case) if b = 1, set e, ; = 1. 

(b) (Recurse) else, set = EVAL(^ _ 17 S, {pj : i 6 S}). 

2. Return ^E,. es fcf i Tr^-ft)^ . 



the procedure Linearize in Sec. 3.4 of BAKK991 , extended to the setting of operators and a more complex 
error structure. Although LINEARIZE is presented as linearizing an objective function here, the same 
techniques straightforwardly apply in linearizing constraints involving high-degree inner products. 

To prove correctness of our final approximation algorithm, we require the following two important 
lemmas regarding Pi- The first shows that any feasible solution (pi, . . . , p„) for Pi consistent with the 
sample set {p, : i G S} fed into LINEARIZE is also a feasible solution for P2 with high probability. 

Lemma 15. Let t^, assignment {p\, . . . ,p n ), S, and {pi : i G S} be defined as in Lem. \13\ Then, for any 
f,g>0, calling LINEARIZE with parameters tjc {f>i : i G S}, and e = (for defined in Eqn. ([9])) 
yields an SDP P2 for which the assignment {pi, . . . ,p n } is feasible with probability at least 1 — d 2k n k ~f. 

The second lemma is a bound on how far the optimal solution of P2 is from the optimal solution for 
Pi . We adopt the convention of BAKK991 and write [x, y] ± z to denote interval [x — z, y + z] . 

Lemma 16. Let OPTp be the optimal value for P\ with corresponding assignment p OFJ p ;— 



and let {pj : i G S} be such that 



Pi 



opt 



< 5 for alii G S for some S C {1,, 



Ol >■■■> P'n 
,n}. Let P2 denote 



the SDP obtained by calling LINEARIZE with S, and denote by £ m for 1 < m < k the error parameter 
passed with map t m into a (possibly recursive) call to LINEARIZE. Then, letting OPT2 denote the opti- 
mal value 0/P2, we have with probability at least 1 — d 2k n k ~f (for parameters set as in Lem. 1771 ) that 
OPT 2 G OPTp ± d{d + y/2) Em= 1 i(^) fc - 1 - m e 



3.3 The final algorithm 

We finally present our approximation algorithm, APPROXIMATE (Alg. [TTT ). in its entirety, which ex- 
ploits our ability to linearize Pi using LINEARIZE (Alg. [T4l. This proves Thm[lOl which in turn im- 
plies Thm. |U We first clarify a few points about APPROXIMATE, then analyze its runtime, and follow 
with further discussion, including the algorithm's derandomization and a proof that dense MAX-fc-local 
Hamiltonian remains QMA-hard. 

We begin by explaining the rationale behind the constants in Alg. [T7] The constant e s d p is the 
additive error incurred when solving an SDP [ GLS 93j We choose e' so that after running LINEARIZE 
and solving P' 2 , the total additive error is at most e, as desired. We choose / to ensure the probability of 
success is at least 1/2. Finally, we set g large enough and 5 (for our <5-net) small enough to ensure that 
e' matches the error bounds for EVAL in Lem. [13] 
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Algorithm 14. LINEARIZE( t b ,Af , S, {pi : i G S}, e, U, L ). 

• Input: (1) A degree-fc inner product t b : H(C d ) x " H> R for 1 < b < k. 

(2) A set of linear constraints AT (e.g. "p, ^ 0"). 

(3) A subset S C {1, . . . ,n} of size |S| = O(logn). 

(4) Sample points {p, : G S} consistent with some feasible solution (p\, . ■ ■ ,p n ) for 
Pi such that || pi — pi || F < 8 for all i G S. 

(5) An error parameter e > 0. 

(6) (Optional) upper and lower bounds U, L G R. If U and L are not provided, we 

assume U,L = oo. 

• Output: (1) (Optional) A linear objective function / : (L(C rf )) x " -> R. 

(2) An updated set of linear constraints, Af. 

1. (Base case) If b = 1, then 

(a) (Trivial: Initial objective function was linear) If U = L = oo, return [tj,, Af]. 

(b) (Reached bottom of recursion) Else, return [Af U {"L < t\,{p\, ■ ■ ■ ,p n ) < U"}]. 

2. (Recursive case) For i = 1 . . . n and j = 1 . . . d 2 do 

(a) Set e !y := EVAL(^_ 1 _, S, {p t : i G S}). 

(b) Set e' := e - dS (J { + 5) A 6-1 , for A defined in Eqn. ©. 



(c) Set Ijj := e i; - - e'n fc_1 and u !; - := + e'n b ^ 1 . 



(d) Call LINEARIZE(^_ v Af, S,{pi : i G S},e', u, 7 , / !; ). 
3. (a) (Entire computation done) If U = L = oo, return J^ij ^{°'iPi) e ij,-^' 



(b) (Recursive call done) Else, return 



AfU {"L-e'd 2 n b < Ei/Tr^)^ < U + e'dV"} 



We now analyze the runtime of Alg. [T7J Let |G| denote the size of our £-net G for a qudit. Then, 
for each of the |G|^ los " iterations of line 6, we first take 0(n fc_1 ) time to run LINEARIZE, out- 
putting 0{n k ~ l ) new linear constraints (seen via a simple inductive argument). We then solve SDP 
P l 2 , which can be done in time polynomial in n and log(l/e s dp) using the ellipsoid method MGLS931 
(see, e.g., BWat091 ). Let r(n, e s dp) denote the maximum runtime required to solve any of the P l 2 . 
Then, the overall runtime for Alg. [171 is 0(n^ log l G l {n k ~ l + r( n ' e sdp)))> which is polynomial in n for 
e,d,k G O(l) (recall from Sec. [3]that |G| G 0((|) rf ), and that 8 and g are constant in our setting). Note 
that, due to the implicit dependence of g on e, this runtime scales at least exponentially with varying e. 

Before moving to further discussion, we make two remarks. First, one can convert the output of 
Alg.QjJto a pure state with the same guarantee by adapting the standard classical method of conditional 
expectations MVazOll . To demonstrate, suppose {p{\ is output by Alg. [FT] Then, set p[ to be the 
eigenvector \tpj){tfj\ of p\ for which the assignment \ipj){ipj \ <8> pi <8> • • ■ <8> p n performs bes|^| for P\. 
Let our new assignment be p 1 ^ ® p2 ® • • • ® p n - Now repeat for each pi for 2 < /' < n. The final state 
p[ ® • • • ® p' n is pure, and by convexity is guaranteed to perform as well as p\ ® • • • <S> p n ■ 

Second, recall from Sec.[3]that we constructed a £-net over a space larger than D(C rf ), allowing pos- 
sibly non-positive assignments for a qudit. We now see that this is of no consequence, since regardless of 
which samples (positive or not) we use to derive our estimates with the Sampling Lemma, any feasible 
solution to P 2 in Alg. Qjjis a valid assignment for Pj. Moreover, we know that for each optimal p, for 

9 If the spectrum of p, is degenerate, begin by fixing an arbitrary choice of spectral decomposition for p,-. 
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Algorithm 17. APPROXIMATE^ H , e). 



• Input: (I) A k-local Hamiltonian H = £4 t ,...,i k Hh,...,i k f or each Hi t/ _j k G H((C d ) k ). 

(2) An error parameter e > 0. 

• Output: A product assignment p\ ® • • • ® p n that with probability at least 1/2, has value at least 

OPTp — en k ,for OPTp the optimal value for H over all product state assignments. 

1. Set e sdp := e/10. 

2. Define h : R — > R such that for any error parameter e input to LINEARIZE, h(e)n k is the 
absolute value of the bound on additive error given by hem. [76] Then, define e' implicitly so that 
h(e') + e sc jp = £ holds. 

3. Define constant f such that 1 — d lk n k ~f > 1/2. 

4. Define constants g and 5 implicitly so that e' = ^y^| + $\ ^ ) > / or A defined in Eqn. ([£]). 

5. Choose g log n indices S C {1, . . . , n} independently and uniformly at random. 

6. For each possible assignment ifrom our 5-net to the qudits in S: 

(a) Call LINEARIZE (tk, {P 1 's constraints}, S, i, e') to obtain SDP V\. 

(b) Let tti denote the value ofP\ obtained by substituting in the optimal solution ofP\. 

7. Return the assignment corresponding to the maximum over all a,-. 



Pi, there must be some operator (positive or not) within distance 8 in our net, ensuring our estimates 
obtained using the Sampling Lemma are within our error bounds. 

Converting the absolute error of Algorithm [17] into relative error. To convert the absolute error 
±en k of Alg. [Tjjinto a relative error of 1 — e' for any e' , define constant c such that cn k is the value 
obtained for a MAX-A>local Hamiltonian instance by choosing the maximally mixed assignment I/d n 
(analogous to a classical random assignment). Since I / d n can be written as a mixture of computational 
basis states, we have OPTp > cn k . It follows that by setting e = ce', Alg. [TT] returns an assignment 
with value at least OPT P - ce'n k > OPT P - e'OPT P > OPT P (l - e'), as desired. 

Derandomizing Algorithm [17] The source of randomness in our algorithm is Lem. [TT] By a stan- 
dard argument in MAKK99B (see also HBR941 IBGG93II ). this randomness can be eliminated with only 
polynomial overhead. Specifically, we replace the random selection of g log n indices in the Sampling 
Lemma with the set of indices encountered on a random walk of length O(glogn) along a constant 
degree expander HGH93H . Since the expander has constant degree, we can efficiently deterministically 
iterate through all n°^s) such walks, and since such a walk works with probability 1 / n°^\ at least one 
walk will work for all poly(n) sampling experiments we wish to run. 

QMA-hardness of dense MAX-fc-local Hamiltonian. It is easy to see that (exact) MAX-2-local 
Hamiltonian remains QMA-hard for dense instances (a similar statement holds for MAX-2-S AT MAKK99II ) 
For any MAX-2-local Hamiltonian instance with optimal value OPT, we simply add n qudits, between 
any two of which we place the constraint 1 00) (00 1 (no constraints are necessary between old and new qu- 
dits). Then, the new Hamiltonian has optimal value OPT + (~), making it dense, and the ability to solve 
this new instance implies the ability to solve the original one. The argument extends straightforwardly 
to MAX-fc-local Hamiltonian for k > 2. 
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use the Sampling Lemma to estimate the inner sum over neighbours N(i) of i, we might 
run into a problem if i does not have 0(n) neighbours. To circumvent this [AKK99], ob- 
serve that Lem.fTTIonly gives us an estimate to within ±en. Thus, if N(i) < en/ 10 (say), 
then we do not use the Sampling Lemma, but rather let our estimate be simply 0, which 
is guaranteed to fall within the desired error bounds (observe an estimate of does not 
necessarily work, on the other hand, if N(i) is large (say N(i) = n — 1), since typically 
f/g< 1). Throughout the remainder of our discussion, we assume this cutoff principle 
is implicitly present when employing Lem. QT] 
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A Technical proofs for Section |3] 

Expanded Notation. We now expand on our previous notation for analyzing Eqn. © in order to 
facilitate proofs of the claims in Sec. [3] First, to recursively analyze a clause Hi lt „j k C H((C d )® k ), let 
Hh G H((C d )® fc ) for any 1 < b < k denote the action of H ;i/ !(c restricted to the first b of its k target 

qubits, i.e. H h := zf b ,..., h rf^ffh ® ' • • ® For example, H x = rjj;;;; fa h and H k = H h k . 

Note that implicitly depends on variables i\, . . . , ik,jb+l> ■ ■ ■ ,jh but to reduce clutter, our notation 
does not explicitly denote this dependence unless necessary. Next, to recursively analyze a degree- 
a inner product, we define t„j, : H(C d ) xn M- K for any < a < k and 1 < b < k such that 

t a ,b(Pl' ■ ■ ■ rPn) '■= Yu a ,...,h ^ r (tfb ' ik Ph ® " " " ® Pii) (where setting a = eliminates the sum over 
indices i). For example, tty is our full "degree-A:" objective function in Eqn. ©, and more generally, 
is the degree-b inner product in Eqn. ([8]). Allowing different values for a and b greatly eases our technical 
analysis. We use the shorthand f j, to denote t^j,, and again only explicitly denote the dependence of f B/ j, 
on parameters i a +i, ■ • . ,ik an d /b+l/ •••//* when necessary. 

Lemma 18. Let {pi}" =1 Q H(C d ). For any MAX-k-local Hamiltonian instance {Hi v ._j k j C H(C ) 
w;Y/i decomposition for the j. as given in Eqn. ((Sj), we have for any < a < k and 1 < b < k that 

\ta,b{p\,---,pn) \ < (maxi 6 h || piJIj, • • • || p h || F ) d\rf. 

Proof of Lem. [7#] By the triangle inequality and the Holder inequality for Schatten p-norms, we have 



'■a.b | 



E Tr(Hbpi b ®---®pi 

i a ,-,h 



< E \\ H bh\\pk®---®Ph || F 

i a ,...,ii 

< ( max || p 4 || F - ■■ || p fl || p ) E H H bllF' 



where we have used the fact that || A ® B || F = || A || F || B || p for all A, B G L(C d ). If we can now 
show that || Hj, ||p < || || F for all 1 < b < k, then we would be done since we would have 

i-lln. 



;>1 



Hi 



b Hf 



< ||H)t|| F n' 7 < din", where II Hjt || F < since II Hfc 



deed, we claim that for any fixed 1 < fe < fc, we have || Hb |L < 2 2 || Hfc 
straightforward expansion of the Frobenius norm and the fact that Tr(cr;Cy) = 2Sjj that 



< 1 by definition. In- 
|p. To see this, note by 



H, 



fc Hf 



Tr(H,2) 



jb,-,jk 



r l\,...,lk 



b-k 

2~ 



k 
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where x ll '— Ak is the coordinate vector of H^j, from Eqn. ([8]). By the second equality in the chain 



above, we see that in fact || H t 



k Hf 



rh,—Ak 



, , completing the proof of our claim. 



□ 
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Proof of Lent. 1731 We first derive the error bound of €},, and subsequently prove the probability bound. 
Wefollow MAKK991 . and proceed by induction on b. For the base case b = 1, EVAL(Hi, S, {pi : i G S}) 
attempts to estimate 



h(pi,...,p n ) = £ 



using our flawed sample points {pj : / 6 S}. To analyze the error of its output, assume first that our 
sample points are exact, i.e. pi = Pl for all i G S. Then, by setting "a" in Lem [TT1 to for i = i\, 

and by using Lem [T8l with parameters a = and b = 1 to obtain upper bound M = &~-, we have by the 
Sampling Lemma that (with probability at least 1 — n~f) 



h<=s 



k / f 

G t 1 (p 1 ,,.,,p n )±d^J-n. 

V 



(10) 



This bound holds if we sum over exact sample points. If we instead sum over flawed sample points 
{pi : z G S}, the additional error is bounded by A times 



E 

heS 



<E Li::!p^h(ph-Ph)) < E(lla-aM 3 )<^ 

i^es /i lies 

(11) 

where the second inequality uses Lem. [18] with parameters a = and b = 1 and the promise of our 
<5-net. We conclude for the base case that 



EVAL(Hi, S, {pj : i G S}) 



E 



G h(px,...,p n ) id? [ W-4 



as desired. 

Assume now that the inductive hypothesis holds for 1 < m < b — 1. We prove the claim for 
m = b. To do so, suppose first that the recursive calls on line 1(b) of Alg. [121 return the exact values 
of t l l_Ap\, . . . ,p n ), and that we have exact samples {p, : i G S}. Then, since by calling Lem. [T8l with 

a = b — 1 we have 5^yTr(cy(0i)^_ 1 (j0i, . . . ,p n ) < d^n b ^ 1 , it follows by the Sampling Lemma that 



T5iE 



ieS 



E Tr ( (r /Pi) t ?-i(pl/---/Pn) 



^E 

i=i 



E Tr (^)4-i(/ , i'---'P») 



±diJ^n D . (12) 



To first adjust for using flawed samples, observe that an analogous calculation to Eqn. (TTTTt yields 
]§T Eies Tr(cy(|0t — |6/)) < d^5n b , where we have called Lem. [T8l with a = b — 1. Thus, using 
flawed samples, the output of Alg. [T2l satisfies 



l D l ieS 



E Tr W0#-i 



^E 

/-i 



E Tr (^')^-i 



(13) 



To next drop the assumption that our estimates eu on line 1(b) are exact, apply the induction hypothesis 
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to conclude that e« G t^_^{p\, . . . ,p n ) ± e b _\V b 1 . Then, 



is] 



E 



E Tr W)««i 



G 



C 



C 



E 

ieS 

E 

ieS 

E 



where the last statement follows since 



d 2 

E Tr W-) 

7=1 



E Tr h 7 ) E rm(rm 

7=1 V \«=1 



E^W/) K-i ±e &-i" 



6-1 



ETr(o- ; p f )4 / _ 1 
E Tr WK-i 



± 



^6-1" 



E 



d 2 

E 

7=1 



±e b - 1 V2d(l + 5)n b , 



d 2 



< 2 |f w | <2d||r|| 2 < \Z2d(l + £), 



m=l 



(14) 



(15) 



where f denotes the coordinate vector of pi with respect to basis {o~ m }, and we have used the facts that 
Tr((7iC7y) = 26jj, that || x \\ t < \fd || x || 2 for x G C d , that || p t || F = \/2 || r || 2 for any p t G H(C d ), 
and that |j p, • || F < 1 + <5 (which follows from our <5-net and the triangle inequality). Thus, recalling that 
A = \^2d(l + 5) and substituting Eqn. (PT3T ) into Eqn. (fT4l . we have that 



;/ 

|S| 



E 



E Tr ( t7 7'^) e i J 



G t h (px,...,p n ) ± 




+ <5 +e fc _iA 




We hence have the recurrence relation < c?z ( « / 1 + <5 ) + e^-i A, which when unrolled yields 



e b < d 2 



as desired. This concludes the proof of the error bound. 

To prove the probability bound, we instead prove the stronger bound of 1 — [JJ m=0 

by induction on b. The base case b = 1 follows directly from our application of the Sampling Lemma 
in Eqn. (flOl) . For the inductive step, define for brevity of notation 7 := d 2 n, and apply the induction 
hypothesis to line 1(b) of Alg. [12] to conclude that each of the 7 calls to EVAL fails will probability 
at most (Y^n^o 7 m ) n ^- Then, by the union bound, the probability that at least one call fails is at 
most (X^jl\ "y m )n^f . Similarly, since our application of the Sampling Lemma in line 2 of Alg. [L? 



-6-1 



d 2 " 



-f 



fails with probability at most n f , we arrive at our claimed stronger bound of 1 — ( Y^ m =o 7™ ) n ^ > as 



desired. 



□ 



Proof of Lent. 1751 We begin by observing that if one sets e = e^, then the value of e' in line 2(b) of 
Alg. HH is precisely e^-i, and more generally, the e passed into the recursive call of line 2(e) on t b 
for any 1 < b < k is e b . Now, focus on some recursive call on t\, for b > 1 (the case of b = 1 is 
straightforward by Lem. [T3T >. If the estimates e« in line 2(a) succeed, then by Lem. \13\ we know that 

ejj G t'l^pi, . . . , p n ) ±e b _ 1 n h ~ 1 , implying ?l_ x {p\, ■ ■ ■ , p n ) G Now, /,-,- and M,y are only 

incorporated into linear constraints in recursive calls on tl_ v yielding constraints of the form 



(16) 
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But {pi, . . . ,p n } must now satisfy this constraint, since recall 

tb-l(pi,...,Pn)= E Tr K-iPi»-i) t B-2*~ 1 (Pl/"-/Pn)/ 

h-l,jb-l 

and there are d 2 n terms ej b l j b l in Eqn. (1161 ) each yielding an additional error of at most 6b— 2^1 
(assuming EVAL succeeded on f?"! in line 2(a)) above and beyond the bounds tl_Ap\, . . . ,p n ) G 
[hit u ij] we established above. 

We conclude that if, for all b, i, and /, EVAL succeeds in producing estimates ej , then {pi, . . . ,p n } 
is a feasible solution for Vi, as desired. The probability of this happening is, by the proof of Le m.[l3l at 
least 1 — d 2 n f, since EVAL recursively estimates precisely the same terms during its execution 10 ! □ 

Proof of Lent. [76] We begin by proving that for any recursive call to LINEARIZE on ty with valid upper 
and lower bounds U and L (i.e. U,L ^ oo), respectively, we have for any feasible solution (pi 7 • • • >pn) 
to P 2 that 



h(pi,...,p n ) e [L, U] ± did + V2 



fe-i 



£ (v / 2d) 6 - 1 - m e J) 



m—l 



(17) 



We prove this by induction on b, following MAKK99I . For base case b = 1, the claim is trivial by 
line 1(b) of the algorithm. Now, assume by induction hypothesis that 



H-M> ■■■>Pn)£ [hjtUij] ± d(d + V2 



6-2 

E 



6-2-m, 



By substituting the values of Z« and u,y from line 2(c), we have 
t^px, ...,p n )e e tj ±ld(d + V2 



b-2 
m=l 



b-2-m, 



+ £b-i n 



6-1 



We conclude that 
hiPlt ■ ■ ■ t Pn) = 

C 



E Tr ( cr /^')d(Pi'---'P") 



c 
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E Tr (^') e y 
E Tr O)Pt) e y 
[L, U] ± e b -id 2 n h 
C [L, LZ] ± </(<* + \/2) 



d(d + \/2) 



6-2 
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\b—2—m, 



Vld I c 



d(d 



Vld yd(d + a/2) 
E(^) 6 - 1 - m em 



£(V2d) 

b J^(V2d) h - 2 - m 
m=l 

b-2 

E(^) 



+ e&-l 



E Tr (°/P'') 

ij 



e b -i n 



,6-1 



(18) 



6-2 

E< 

m—l 



b-2-m. 



e h -x n 



where the third statement follows from a calculation similar to Eqn. (fT5l) . and the fourth statement from 
line 3(b) of Alg.fPfl This proves the claim of Eqn. (TT71) . 



io n 



°This holds even though on line 1 of Alg.fTJ] we only estimate d 2 \ S\ of the terms (i.e. EVAL does not actually estimate 
all terms in the recursive decomposition of t^, as it does not need to) — this is because in our analysis of the probability bound 
for Alg.[T2] we actually produced a looser bound by assuming all n terms e,-y are estimated. 
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To complete the proof of Lem. [T6j observe that by Lem. [T5j the assignment p ^ is feasible for 
P2 with probability at least 1 — d 2k n k ~f. Thus, plugging jO opt into each of the d 2 n linear constraints 
produced by the recursive calls to LINEARIZE on each we have by Eqns. (TT71) and (fT8l ) that (with 
probability 1 - d 2k n k ~f) 



c 



^Tr(cr ; -p° pt )^ 



C OPT 2 ±d(d + y/2) 



± V2d yd(d + y/2) 

m=l 



it-2 

E 

m=l 



E(v / 2rf) 



k-2-m. 



where the last statement follows since p opt is not necessarily the optimal solution to P2. 



eit-i n 



□ 
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